Measuring the disclosure protection of micro aggregated business microdata An analysis taking the example of German Structure of Costs Survey
نویسنده
چکیده
To assess the effectiveness of an anonymisation method with respect to data protection, the disclosure risk associated with the protected data must be evaluated. We consider the scenario where a possible data intruder matches an external database with the entire of confidential data. In order to improve his external database he tries to assign as many correct pairs of records (that is, records referring to the same underlying statistical unit) as possible. The problem of maximisation of the number of correctly assigned pairs is translated into a multi-objective linear assignment problem (MOLP). Regarding several variants of the micro aggregation anonymisation method applied to the German structure of costs survey, we calculate approximative solutions to the MOLP obtained by using two external databases as the data intruder’s additional knowledge. Finally, a standard for so-called de facto anonymity is suggested.
منابع مشابه
Anonymization of statistical data
In the modern digital society, personal information about individuals can be collected, stored, shared, and disseminated much more easily and freely. Such data can be released in macrodata form, reporting aggregated information, or in microdata form, reporting specific information on individual respondent. Protecting data against improper disclosure is then becoming critical to ensure proper pr...
متن کاملEmpirical evidences on protecting populations
This paper describes the process of statistical disclosure analysis and control applied by the Statistical Institute of Catalonia (Idescat) to microdata samples from census/surveys with some population uniques. Since 1995, by means of models which allows calculation of the risk and data protection procedures, some empirical evidences have been achieved in order to check the performance of -ARGU...
متن کاملMicrodata Protection
Governmental, public, and private organizations are more and more frequently required to make data available for external release in a selective and secure fashion. Most data are today released in the form of microdata, reporting information on individual respondents. The protection of microdata against improper disclosure is therefore an issue that has become increasingly important and will co...
متن کاملStatistical disclosure control in tabular data
Data disseminated by National Statistical Agencies (NSAs) can be classified as either microdata or tabular data. Tabular data is obtained from microdata by crossing one or more categorical variables. Although cell tables provide aggregated information, they also need to be protected. This chapter is a short introduction to tabular data protection. It contains three main sections. The first one ...
متن کاملA CRONYM : Data without Boundaries D
Disclosure limitation methods for protecting the confidentiality ofrespondents in survey microdata often use perturbative techniques whichintroduce measurement error into the categorical identifying variables. Inaddition, the data itself will often have measurement errors commonly arisingfrom survey processes. There is a need for valid and practical ways to assess theprotect...
متن کامل